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Sampling Strategies for Finite Population Using Auxiliary Information 


Abstract 


This paper deals with the problem of estimating the finite population mean when some 
information on auxiliary attribute is available. It is shown that the proposed estimator is more 
efficient than the usual mean estimator and other existing estimators. The results have been 
illustrated numerically by taking empirical population considered in the literature. 


Keywords Simple random sampling, auxiliary attribute, point bi-serial correlation, ratio 
estimator, efficiency. 


1. Introduction 


The use of auxiliary information can increase the precision of an estimator when 
study variable y is highly correlated with auxiliary variable x. There are many situations 
when auxiliary information is available in the form of attributes, e.g. sex and height of the 
persons, amount of milk produced and a particular breed of cow, amount of yield of wheat 
crop and a particular variety of wheat (see Jhajj et. al. (2006)). 

Consider a sample of size n drawn by simple random sampling without replacement 


(SRSWOR) from a population of size N. Let y; and ; denote the observations on variable y 
and @ respectively for i unit ( i=1, 2......., N). 


Let ;=1; if the i* unit of the population possesses attribute = 0; otherwise. 


N n 
Let A= >°o; and a= 5°; , denote the total number of units in the population and sample 
i=l i=l 


respectively possessing attribute . Let P=A/N and p=a/n denote the proportion of units in 
the population and sample respectively possessing attribute ». Naik and Gupta (1996) 
introduced a ratio estimator tjg when the study variable and the auxiliary attribute are 


positively correlated. The estimator t,, is given by 
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tnc =y— 
p (1.1) 
with MSE 
2 202 
MSE(tyg) = f,(S2 +R S? —2RS 4) (1.2) 
N-n Y f .N = 
where f); = : ka ae tee 
; Nn P m a 
2; 
i 1 N 1 N = 
= _5(6;-P) , Syy=—~ ¥(6; - Pi -¥) 
Nia v N-1jq 


(for details see Singh et al. (2008)) 


JShajj et. al. (2006) suggested a family of estimators for the population mean in single and two 
phase sampling when the study variable and auxiliary attribute are positively correlated. 
Shabbir and Gupta (2007), Singh et. al. (2008) and Abd-Elfattah et. al. (2010) have 


considered the problem of estimating population mean Y taking into consideration the point 
biserial correlation coefficient between auxiliary attribute and study variable. 


The objective of this article is to suggest a generalised class of estimators for population 


mean Y and analyse its properties. A numerical illustration is given in support of the 
present study. 


2. Proposed Estimator 


Let 0; =; +mA, m being a suitably chosen scalar, that takes values 0 and 1. Then 
q=p+mA=p+NoP , and 
Q=(Nm+ DP, 


B N n 
.Q=5.B= Yo; and b= Yo. 
i=l i=l 


Motivated by Bedi (1996), we define a family of estimators for population mean Y as 


t= lwiy+ w b(P -o){ 4) 
Q (2.1) 


where w,,w, and @ are suitably chosen scalars. 


To obtain the Bias and MSE of the estimator t, we write 
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y=Y(l+eq) ; p=P(l+e,) 5 S$ =S5(I+e2), 
S by =S 4, (1+e3) 4 b=B(l+e3)I+e,)! 
such that E(e;)=0 , i=0,1,2,3 and 
5) 1 ee) Cater, 5) 1 1 5) 
Be) =(+-)c} , Bef) =(+-2)c3 ’ 


i ieee | 1 1 
E(ege) = (= ze ~ PrwCyCp: E(eje2) = (= = = Jepios. 


fA ee 
E(e1e3) = c,—, 
n 


Expressing (2.1) in terms of e’s , we have 


t=¥|will +eo)-w2 Bey(tresyteeny" [14 a ) 


(2.2) 


e] al 


<l, so that (1+e,) 7" and | 1+ 
1 Nm + 


Qa 
) are expandable. 


We assume that |e,|<1 and | 


Expanding the right hand side of (2.2) and retaining terms up to second powers of e’s ,we 
have 


RR = 2 
(F=f + ae, | ox(o. \) al n sees 


Nm+1 2) (Nm+1)? Nm+1 


2 


B 1 
Ww €; +eye3 —eje> + 
AR tee ey 


=] (2.3) 


Taking expectation of both sides of (2.3) , we get the bias of t to the first degree of 
approximation as : 


B M42 33 
Wot “Po, 2 oo Neal? (2.4) 


Squaring both sides of (2.3) and neglecting terms of e’s having power greater than two, we 
have 
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2 a(2a _- le? 4aege] 


—\2 =—2 2 
(\-¥) =Y wr 1+ 29+ “1 € 
Nm +1 (Nm +1) Nm +1 
2 2 
2(B) 2 B 2ae 
+ws]|—]| e7 +1-2wy wy —4e) +e7e2 + ene; —e;e5 + 
a(£) 1 1W2 1 1%3 Ovl 1*2 Nm +1 
ole] ae ye} oa — Ie? 
—2w 1+eg + 
Nm+1 Nm+1 2(Nm +1)? 
ae] 
(2.5) 


B 
+ 2wy — e] +e1e3 —e7€9 + 
2 R 1 1%3 1¥2 Nm +1 
Taking expectation of both sides of (2.5), we get the MSE of t to the first degree of 
(m) (m) ] 
+ 2W2AS(a) (2.6) 


approximation as: 
MSE(t) = ¥- [1+ w2A(™) + w3A, —2w w oA) —2w,A 
1 I(a) 20n) LY 2 3(a) 1 A(x.) 
on _ 
where, A) eli eRe? pa | eae 
(a) Nm+1\ Nm+1 
2; 
2 
a =(2) nc 
2a C 
alm) Be} @2 eh ON 
3(a) “Rp! “Pl ne - 12 —CpA03 
= 
7 na rece 2 
4(c.) Nm+1 _|2(Nm+1) 


Cy 
where , k = Pb Cc, 
The MSE(t) is minimised for 
w= eae - ey = Wi (2.7) 
Moa)? *2 3(a) 
(a@\aG AAS) ns 
| | : 
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3. Members of the family of estimator of t and their Biases and MSE 


Table 3.1: Different members of the family of estimators of t 
Choice of scalars Estimator 
Ww, WwW, 
1 0 0 t)= y 
to =w1y 
Ww, 0 0 Searls (1964) type estimator 
W, 0 a (3 =W [2] 
Ww 0 e t4=w (2) 
Sue 
it 0 -1 0 fs > y = 
Naik and Gupta (1996) estimator 
= P 
1 1 | t =f +0(P-p)- 

Singh et. al. (2008) estimator 
al Ww2 p t, -[w,y+w,b(P—p)| 
vl ; t, = [w,y +b(P-p) 

O ae 
_ ~ tg = wly +b(P—p)| 
i ; ‘ tio =y + b(P-p) 
Regression estimator 


The estimator t, =y is an unbiased estimator of the population mean Y and has the variance 


Var (ty) = f1S5 (3.1) 
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To, the first degree of approximation the biases and MSE’s of t;'s, i=1,2........ ,10 are 
respectively given by 
Bitz )= Y(w, -1) (3.2) 
Bis) = Vl wee Es = = pC a=) @2 
: : PT Nm +15 PPY~P (Nm +1)? 
(3.3) 
= a(a—1) 
B(ty) = Y| (wy -1)+ wif}} apppCyCp + 5 
(3.4) 
V. 2 
B(ts )= Yf; Ic? pgs, (3.5) 
ei. B M2 
B(tg )= Yfy| (C5 —PppCyCp) R| OP Cpr03 
Ppb (3.6) 
Be) 1) 128 ic M2 cya} 
P pb (3.7) 
Blt) 3161-0 Bef Cyr 
Ppb (3.8) 
Hs )¥ 1)- wh r ic 12 cya} 
pp (3.9) 
B(ti9)= vEnlc, Les C An 
Ppb (3.10) 
The corresponding MSE’s will be 
0 
MSE(t> )= Y | +w? AO) ow a), (3.11) 
MSE(t3 )= Y[+w? ale) —ow,ale) 
(3.12) 
0 0 
MSE(t4 )= Y[1+w? a0) ow.a® | (3.13) 
0 0 
MSE(ts )= Yh +a! Oy 24) | (3.14) 
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MSE(tg ) = ¥h+al Qt Ae- 2A) 17 2a) +20) oI (3.15) 
MSE(t7 )= ¥"h+w7A), AWS Aps 2wiw oA) - 2w Al +2w2Al)| (3.16) 
MSE(tg )= ¥ "f+ wPall) +A, - 2w (Al), + Al) }e 240)| (3.17) 
MSE(ty )= Yh Ww (alo) oe 2A%) } aula, > AM) (3.18) 
MSE(t 1 )= ¥* [+00 (0) tA2- (al? 0) +A, -a)} (3.19) 

The MSE’s of the estimaors of ti, i=2,3,4,7,8,9 will be minimised respectively, for 
AN 
Wi = 
At (3.20) 
oe Ae), 
: Aim) 
ilo) (3.21) 
«ee 
AN) 3.22) 


: (a,a), = A040) 
[a2a(t 7 AQF | 


(0) (0) _ ,) ,) 
A) a®, - A a } (3.23) 


yee 
(0) | (0) i 
A241) —\43(0) 


_ (A) + AS) 
ma AO) 
i(0) (3.24) 
a AY), i AM) 
=l0 =a) 
Aj) tA2 — 2A30) (3.25) 


Thus the resulting minimum MSE of tj, i= 2,3,4,7,8,9 are, respectively given by 
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(0) 
min. MSE(t))= Y~| 1— wal 


AX) (3.26) 
alm 

min. MSE(t3)=Y~ ah 
Aja) (3.27) 


min. MSE(t4)=Y ee (a 


0 
Aa) (3.28) 
Ds [a2 (a) = 2A al ‘ Alt (asf 
min. MSE(t7)=Y"|1 3.0 
Aral) z AM) } (3.29) 
(0) _ (0) 
min. MSE(tg)= Y"|1+A> +2 0 a 
AY) (3.30) 
(0) _ ,() 
min. MSE(tg)= "| 1 a Aa 
Ayo) *A2 2430) (3.31) 


4. Empirical study 


The data for the empirical study is taken from natural population data set considered 
by Sukhatme and Sukhatme (1970): 


y = Number of villages in the circles and 

» =Acircle consisting more than five villages 
N =89, Y =3.36,P =0.1236 Ppb = 0.766,Cy = 0.6040,C,, = 2.190 
Log = 6.1619,A49 =3.810, Aya =146.475,A93 = 2.2744 


In the Table 4.1 percent relative efficiencies (PRE’s) of various estimators are computed with 
respect to y. 


28 


Sampling Strategies for Finite Population Using Auxiliary Information 


Table 4.1: PRE of different estimators of Y with respect to y. 


Estimator PRE’s 
Gy. 100.00 
2) 101.41 
3 90.35 
t4 6.92 
ts 11.64 
t6 7.38 
ty 100.44 
tg 243.39 
tg 243.42 
to 241.98 


Conclusion 


The MSE values of the members of the family of the estimator t have been obtained 
using (2.6). These values are given in Table 4.1. When we examine Table 4.1, we observe the 
superiority of the proposed estimators te, t7, ts, to and tio over usual unbiased estimator t1, ts, 
t4, Naik and Gupta (1996) estimator ts and Singh et. al. (2008) estimator te. From this result 
we can infer that the proposed estimators ts and to are more efficient than the rest of the 
estimators considered in this paper for this data set. 

We would also like to remark that the value of the min. MSE(t10), which is equal to the 
value of the MSE of the regression estimator is 241.98. From Table 4.1 we notice that the 
value of MSE of the estimators tg and to are less than this value, as shown in Table 4.1. 
Finally, we can say that the proposed estimators ts and to are more efficient than the 


regression estimator for this data set. 
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